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Abstract. This paper applies a regularization procedure called increasing rearrange- 
ment to monotonize Edgeworth and Cornish-Fisher expansions and any other related 
approximations of distribution and quantilc functions of sample statistics. Besides sat- 
isfying the logical monotonicity, required of distribution and quantile functions, the pro- 
cedure often delivers strikingly better approximations to the distribution and quantile 
functions of the sample mean than the original Edgeworth-Cornish-Fisher expansions. 
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2 improving edgeworth approximations 

1. Introduction 

Higher order approximations to the distribution of sample statistics beyond the or- 
der n -1 / 2 provided by the central limit theorem are of central interest in the theory 
of asymptotic statistics, see e.g. (Blinnikov and Moessner 1998, Cramer 1999, Bhat- 
tacharya and Ranga Rao 1976, Hall 1992, Rothenberg 1984, van der Vaart 1998). An 
important tool for performing these refinements is provided by the Edgeworth expan- 
sion (Edgeworth 1905, Edgeworth 1907), which approximates the distribution of the 
statistics of interest around the limit distribution (often the normal distribution) by 
a combination of Hermite polynomials, with coefficients defined in terms of moments. 
Inverting the expansion yields a related higher order approximation, the Cornish-Fisher 
expansion (Cornish and Fisher 1938, Fisher and Cornish 1960), to the quantiles of the 
statistic around the quantiles of the limiting distribution. 

One of the important shortcomings of either Edgeworth or Cornish-Fisher expansions 
is that the resulting approximations to the distribution and quantile functions are not 
monotonic, which violates an obvious monotonicity requirement. This comes from the 
fact that the polynomials involved in the expansion are not monotone. Here we propose 
to use a procedure, called the rearrangement, to restore the monotonicity of the approx- 
imations and, perhaps more importantly, to improve the estimation properties of these 
approximations. The resulting improvement is due to the fact that the rearrangement 
necessarily brings the non-monotone approximations closer to to the true monotone 
target function. 

The main findings of the paper can be illustrated through a single picture given as 
Figure 1. In that picture, we plot the true distribution function of a sample mean X 
based on a small sample, a third order Edgeworth approximation to that distribution, 
and a rearrangement of this third order approximation. We see that the Edgeworth 
approximation is sharply non-monotone and provides a rather poor approximation to 
the distribution function. The rearrangement merely sorts the value of the approximate 
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Figure 1. Distribution function for the standarized sample mean of 
log-normal random variables (sample size 5), the third order Edgeworth 
approximation, and the rearranged third order Edgeworth approximation. 



distribution function in an increasing order. One can see that the rearranged approx- 
imation, in addition to being monotonic, is a much better approximation to the true 
function than the original approximation. 

We organize the rest of the paper as follows. In Section 2, we describe the rearrange- 
ment and qualify the approximation property it provides for monotonic functions. In 
section 3, we introduce the rearranged Edgeworth- Cornish-Fisher expansions and ex- 
plain how these produce better estimates of distributions and quantiles of statistics. In 
Section 4, we illustrate the procedure with several additional examples. 
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2. Improving Approximations of Monotone Functions by Rearrangement 

In what follows, let X be a compact interval. We first consider an interval of the form 
X = [0, 1]. Let f(x) be a measurable function mapping X to K, a bounded subset of R. 
Let = f x l{f(u) < y}du denote the distribution function of f(X) when X follows 

the uniform distribution on [0, 1]. Let 



This function /* is called the increasing rearrangement of the function /. The rearrange- 
ment is a tool that is extensively used in functional analysis and optimal transportation 
(see e.g. Hardy, Littlewood, and Polya (1952) and Villani (2003).) It originates in the 
work of Chebyshev, who used it to prove a set of inequalities (Bronshtein, Semendyayev, 
Musiol, Muehlig, and Miihlig 2004). Here we use this tool to improve approximations 
of monotone functions, such as the Edgeworth-Cornish-Fisher approximations to the 
distribution and quantile functions of the sample mean. 

The rearrangement operator simply transforms a function / to its quantile function 
/*. That is, x h- > f*(x) is the quantile function of the random variable f(X) when 
X ~ U(0,1). Another convenient way to think of the rearrangement is as a sorting 
operation: Given values of the function f(x) evaluated at x in a fine enough mesh of 
equidistant points, we simply sort the values in an increasing order. The function created 
in this way is the rearrangement of /. 

Finally, if X is of the form [a, b], let x(x) = (x — a)/(b — a) G [0, 1], x(x) = a+(b — a)x G 
[a, b], and /* be the rearrangement of the function f(x) = f(x(x)) defined on X = [0, 1]. 
Then, the rearrangement of / is defined as 



f*( x ) .= Q f ( x ) .= M{y G R : F f (y) > x} 



be the quantile function of Ff(y). Thus, 




f*(x) := f*(x(x)). 
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The following result is detailed in Chernozhukov, Fernandez- Val, and Galichon (2006). 



Proposition 1 (Improving Approximation of Monotone Functions). Let /q : X = 

[a, b] —>■ K be a weakly increasing measurable function in x, where K is a bounded 
subset of R. This is the target function that we want to approximate. Let f : X — > K 
be another measurable function, an initial approximation or an estimate of the target 
function f . 

1. For any p G [1, oo], the rearrangement of f , denoted f*, weakly reduces the estimation 
error: 

r r ^ v iVp r r ^ v 1 l l p 

(2.1) 
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2. Suppose that there exist regions Xq and Xq, each of measure greater than 5 > 0, such 
that for all x G Xq and x' G Xq we have that (i) x' > x, (ii) f(x) > f(x') + e, and (Hi) 
fo(x') > fo(x) + e, for some e > 0. Then the gain in the quality of approximation is 
strict for p G (1, oo). Namely, for any p G [1, oo], 
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f(x) - fo(x) 



dx — b~x% 



(2.2) 



where 1] P = inf {\v — t'\ p + \v' — t\ p — \v — t\ p — \v' — t'\ p } and 7] p > for p G (1, oo), with 
the infimum taken over all v, v', t, t' in the set K such that v ' > v + e and t' > t + e; and 
5 x = 5/(b-a). 

Corollary 1 (Strict Improvement). If the target function fo is increasing over X and 
f is decreasing over a subset of X that has positive measure, then the improvement in 
L p norm, forp G (l,oo), is necessarily strict. 



The first part of the proposition states the weak inequality (12.11) . and the second 
part states the strict inequality ( 12. 21) . As an implication, the Corollary states that the 
inequality is strict for p G (1, oo) if the original estimate f{x) is decreasing on a subset 
of X having positive measure, while the target function fo(x) is increasing on X (by 
increasing, we mean strictly increasing throughout). Of course, if fo(x) is constant, then 



6 



IMPROVING EDGEWORTH APPROXIMATIONS 



the inequality (12 .11) becomes an equality, as the distribution of the rearranged function 
/* is the same as the distribution of the original function /, that is Fj, = Fj. 

This proposition establishes that the rearranged estimate /* has a smaller estimation 
error in the L p norm than the original estimate whenever the latter is not monotone. 
This is a very useful and generally applicable property that is independent of the sample 
size and of the way the original approximation / to f is obtained. 

Remark 1. An indirect proof of the weak inequality \2. 1\) is a simple but important 
consequence of the following classical inequality due to Lorentz (1953): Let q and g be 
two functions mapping X to K , a bounded subset of R. Let q* and g* denote their 
corresponding increasing rearrangements. Then, 



for any submodular discrepancy function L : R 3 > R. Set q(x) = f{x), q*(x) = f*(x), 
g(x) = fo(x), and g*(x) = fo(x). Now, note that in our case fo(x) = fo(x) almost 
everywhere, that is, the target function is its own rearrangement. Moreover, L(v,w,x) = 
\w — v \ p is submodular for p G [1, oo) . This proves the first part of the proposition above. 
For p = oo, the first part follows by taking the limit as p —>■ oo. In the Appendix, 
for completeness, we restate the proof of Chernozhukov, Fernandez- Vol, and Galichon 
(2006) of the strong inequality k2. 2\) as well as the direct proof of the weak inequality 



Remark 2. The following immediate implication of the above finite-sample result is also 
worth emphasizing: The rearranged estimate f* inherits the L p rates of convergence from 
the original estimates f. For p £ [l,oo], if X n = [J x \ fo{x) — f{x)\ p du] l l p = Op(a n ) for 
some sequence of constants a n , then [j x \fo(x) — f*(x)\ p du] 1 ^ p < X n = Op(a n ). However, 
while the rate is the same, the error itself is smaller. 




L(q*(x),g*(x),x)dx < / L(q(x), g(x), x)dx, 




Remark 3. Finally, one can also consider weighted rearrangements that can accentuate 
the quality of approximation in various areas. Indeed, consider an absolutely continuous 
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distribution function A on X = [a,b], then we have that for (/ o A denoting the 

rearrangement of u i— > /(A _1 (-u)) 

i/p 



[0,1] 



(/oA-^W-MA- 1 ^)) 
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1/p 
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^[0,1] 



/(A- 1 ( M ))-/o(A~ 1 H) 



du 



(2.3) 



or, equivalently, by a change of variable, setting 

Ti{x) = (fo^y(x( x )) 

we have that 



x 



fl(x)-f (x) dA(x) 



i i/p 



< 



X 



f(x)-f (x) dA(x 



i Vp 



(2.4) 



Thus, the function x i— > f^(x) is the weighted rearrangement that provides improvements 
in the quality of approximation in the norm that is weighted according to the distribution 
function A. 

In the next section, we apply rearrangements to improve the Edgeworth- Cornish- 
Fisher and related approximations to distribution and quantile functions. 

3. Improving Edgeworth-Cornish-Fisher and Related expansions 

3.1. Improving Quantile Approximations by Rearrangement. We first consider 
the quantile case. Let Q n be the quantile function of a statistic X n , i.e. 

Q n (u) = M{x G R : Pr[X n < x] > u}, 

which we assume to be strictly increasing. Let Q n be an approximation to the quantile 
function Q n satisfying the following relation on an interval U n = [e n , 1 — e n ] C [0, 1]: 



Qn{u) = Qn{u) + e n (u), \e n (u)\ < a n , for all u e U n . 



(3.1) 



The leading example of such an approximation is the inverse Edgeworth, or Cornish- 
Fisher, approximation to the quantile function of a sample mean. If X n is is a stan- 
dardized sample mean, X n = ^2™ =1 (Yi — E[Yi\) / \fV ' ar '(V^) based on an i.i.d. sample 
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(Y"i, Y n ), then we have the following J-th order approximation 

Qn(u) = Q n (u) + e n (u) 

Q n {u) = Rii^iu)) + R 2 (^>- 1 (u))/n 1 / 2 + ... + i? J ($- 1 («))/n (J - 1)/2 ) 

(3.2) 

\e n (u)\ < Cn J/2 , for all ueU n = [e n , 1 - e n ], 

for some e n \ and C > 0, 

provided that a set of regularity conditions, specified in, e.g., Zolotarev (1991), hold. 
Here $ and $ _1 denote the distribution function and quantile function of a standard 
normal random variable. The first three terms of the approximation are given by the 
polynomials, 

Ri{z) = z, 

R 2 {z) = \{z 2 - l)/6, (3.3) 

R 3 (z) = (3k(z 3 - 3z) - 2\ 2 (2z 3 - 5z))/72, 

where A is the skewness and k is the kurtosis of the random variable Y. The Cornish- 
Fisher approximation is one of the central approximations of the asymptotic statistics. 
Unfortunately, inspection of the expression for polynomials (13.31) reveals that this ap- 
proximation does not generally deliver a monotone estimate of the quantile function. 
This shortcoming has been pointed and discussed in detail e.g. by Hall (1992). The 
nature of the polynomials constructed is such that there always exists a large enough 
range U n over which the Cornish-Fisher approximation is not monotone, cf. Hall (1992). 
As an example, in the case of the second order approximation (J = 2) we have that for 
A < 

Qn(u) \ — oo, as u / 1, (3.4) 

that is, the Cornish- Fisher "quantile" function Q n is decreasing far enough in the tails. 
This example merely suggests a potential problem that may apply to practically relevant 
ranges of probability indices u. Indeed, specific numerical examples given below show 
that in small samples the non-monotonicity can occur in practically relevant ranges. Of 
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course, in sufficiently large samples, the regions of non-monotonicity are squeezed quite 
far into the tails. 

Let Q* n be the rearrangement of Q n . Then we have that for any p G [1, oo], the rear- 
ranged quantile function reduces the approximation error of the original approximation: 

i/p 



Qn( u ) ~ Qn(u) 



V 


i/p 




du 


< 


II 









Qn(u) ~ Qn(u) 



du 



< (1 - 2e n )a n , (3.5) 



with the first inequality holding strictly for p G (1, oo) whenever Q n is decreasing on a 
region of U n of positive measure. We can give the following probabilistic interpretation 
to this result. Under condition ( 13. II) . there exists a variable U = F n (X n ) such that both 
the stochastic expansion 



X n = Q n (U) + O p {a n ), 



(3.6) 



and the expansion 



X n = Q* n (U) + O p (a. 



(3.7) 



but the variable Q* n {U) in (13. 7p is a better coupling to the statistic X n than Q n (U) 
in (13.61) . in the following sense: For each p G [1, oo], 

[El n [X n - Q* n (U)] p ] 1/p < [El n [X n - Q n (U)} p } 1/p , (3.8) 



where 1„ = 1{U G U n }. Indeed, property (13.81) immediately follows from (13.51) . 

The above improvements apply in the context of the sample mean X n . In this case, 
the probabilistic interpretation above is directly connected to the higher order central 
limit theorem of Zolotarev (1991), which states that under (13.21) . we have the following 
higher-order probabilistic central limit theorem, 



X n = Q n {U) + O p {n- J ' 2 ). 



(3.9) 



1 Qn(U) is defined only on U n , so we can set Qn(U) — Q n (U) outside U n , if needed. Of course, 
U $ lA n with probability going to zero. 
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The term Q n (U) is Zolotarev's high-order refinement over the first order normal terms 
$ _1 (i7). Sun, Loader, and McCormick (2000) employ an analogous higher-order proba- 
bilistic central limit theorem to improve the construction of confidence intervals. 

The application of the rearrangement to the Zolotarev's term in fact delivers a clear 
improvement in the sense that it also leads to a probabilistic higher order central limit 
theorem 

X n = Q* n (U)+O p (n- J / 2 ), (3.10) 

but the leading term Q* n (U) is closer to X n than the Zolotarev's term Q n (U), in the 
sense of (13.81) . 

We summarize the above discussion into a formal proposition. 

Proposition 2. If expansion $3. 1\) holds, then the improvement [3. 5\) necessarily holds. 
The improvement is necessarily strict if Q n is decreasing over a region ofU n that has a 
positive measure. In particular, this improvement property applies to the inverse Edge- 
worth approximation defined in I13.S\) to the quantile function of the sample mean. 

3.2. Improving Distributional Approximations by Rearrangement. We next 
consider distribution functions. Let F n (x) be the distribution function of a statistic 
X n , and F n (x) be the approximation to this distribution such that the following relation 
holds: 

F n (x) = F n (x) + e n (x), \e n (x)\ < a n , for all x G X m (3.11) 

where X n = [—b n , b n ] is an interval in R for some sequence of positive numbers b n possibly 
growing to infinity. 

The leading example of such an approximation is the Edgeworth expansion of the 
distribution function of a sample mean. If X n is a standardized sample mean, X n = 
Y^=i0^i ~~ F[Yi] ) I y/VariXi) based on an i.i.d. sample (Y\, Y n ), then we have the 
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following J-th order approximation 

F n (z) = F n (z) + e n (z) 

F n {x) = P 1 {x) + P 2 (x)/n l l 2 + ... + Pj^/nW 2 , 
\tn(x)\ < Cn~ J/2 , for all x G X n , 
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(3.12) 



for some C > 0, provided that a set of regularity conditions, specified in e.g. Hall (1992), 
hold. The first three terms of the approximation are given by 



Pi(x) 

P2(X) 

Psix) 



<$>{x), 

-\(x 2 - l)(f>{x)/6, (3.13) 
-(3k(z 3 - 3x) + A 2 (x 5 - 10x 3 + 15x))0(x)/72, 

where $ and <j) denote the distribution function and density function of a standard 
normal random variable, and A and k are the skewness and kurtosis of the random 
variable Y . The Edgeworth approximation is one of the central approximations of the 
asymptotic statistics. Unfortunately, like the Cornish- Fisher expansion it generally does 
not provide a monotone estimate of the distribution function. This shortcoming has 
been pointed and discussed in detail by (Barton and Dennis 1952, Draper and Tierney 
1972, Sargan 1976, Balitskaya and Zolotuhina 1988), among others. 

Let F* be the rearrangement of F n . Then, we have that for any p G [l,oo], the 
rearranged Edgeworth approximation reduces the approximation error of the original 
Edgeworth approximation: 

i/p 





V 


i/p 




F:(x)-F n (x) 


dx 


< 













FJx) - FJx) 



dx 



(3.14) 



with the first inequality holding strictly for p G (1, oo) whenever F n is decreasing on a 
region of X n of positive measure. 



Proposition 3. If expansion $3.11\) holds, then the improvement jjj3.14\ ) necessarily 
holds. The improvement is necessarily strict if F n is decreasing over a region of X n 
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that has a positive measure. In particular, this improvement property applies to Edge- 
worth approximation defined in (3.12) to the distribution function of the sample mean. 



3.3. Weighted Rearrangement of Cornish-Fisher and Edgeworth Expansions. 

In some cases, it could be worthwhile to weigh different areas of support differently 
than the Lebesgue (flat) weighting prescribes. For example, it might be desirable to 
rearrange F n using F n as a weight measure. Indeed, using F n as a weight, we obtain a 
better coupling to the P- value: P = F n (X n ) (in this quantity, X n is drawn according 
to the true F n ). Using such weight will provide a probabilistic interpretation for the 
rearranged Edgeworth expansion, in analogy to the probabilistic interpretation for the 
rearranged Cornish- Fisher expansion. Since the weight is not available we may use the 
standard normal measure $ as the weight measure instead. We may also construct an 
initial rearrangement with the Lebesgue weight, and use it as weight itself in a further 
weighted rearrangement (and even iterate in this fashion). Using non-Lebesgue weights 
may also be desirable when we want the improved approximations to weight the tails 
more heavily. Whatever the reason might be for further non-Lebesgue weighting, we 
have the following properties, which follow immediately in view of Remark 3. 

Let A be a distribution function that admits a positive density with respect to the 
Lebesgue measure on the region U n = [e n , 1 — e n ] for the quantile case and region X n = 
[—b n , b n ] for the distribution case. Then if (13.11) holds, the A-weighted rearrangement 
Q* n A of the function Q satisfies 



Q*n a( U ) ~ Qn{u) dA(u 



1/p 



< / Qn(u)-Q n (u) dA(u) 

< (A[l - e n ) - A[e„])a f 



i/p 



(3.15) 
(3.16) 



where the first equality holds strictly when Q is decreasing on a subset of positive A- 
measure. Furthermore, if (13.111) holds, then the A-weighted rearrangement F* A of the 
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function F satisfies 



x n 







i/p 






p 


i/p 


K A {x) - F n (x) 


P dA{x) 


< 




F n (x)-F n (x) 


dA(x) 

















< (A[6 n ] - A[-6 n ])a n . 



(3.17) 
(3.18) 



4. Numerical Examples 

In addition to the lognormal example given in the introduction, we use the gamma dis- 
tribution to illustrate the improvements that the rearrangement provides. Let (Yi, Y n ) 
be an i.i.d. sequence of Gamma(l/16,16) random variables. The statistic of interest is 
the standardized sample mean X n = ^ILiO'i — E\Yi\)/ \/V^ar(Fj). We consider samples 
of sizes n — 4, 8, 16, and 32. In this example, the distribution function F n and quantile 
function Q n of the sample mean X n are available in a closed form, making it easy to com- 
pare them to the Edgeworth approximation F n and the Cornish-Fisher approximation 
Q n , as well as to the rearranged Edgeworth approximation F* and the the rearranged 
Cornish-Fisher approximation Q* n . For the Edgeworth and Cornish-Fisher approxima- 
tions, as defined in the previous section, we consider the third order expansions, that is 
we set J = 3. 

Figure [2] compares the true distribution function F n , the Edgeworth approximation 
F n , and the rearranged Edgeworth approximation F*. We see that the rearranged Edge- 
worth approximation not only fixes the monotonicity problem, but also consistently does 
a better job at approximating the true distribution than the Edgeworth approximation. 
Table [1] further supports this point by presenting the numerical results for the L p ap- 
proximation errors, calculated according to the formulas given in the previous section. 
We see that the rearrangement reduces the approximation error quite substantially in 
most cases. 

Figure [3] compares the true quantile function Q n , the Cornish-Fisher approximation 
Q n , and the rearranged Cornish-Fisher approximation Q n . Here too we see that the 
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rearrangement not only fixes the non-monotonicity problem, but also brings the approx- 
imation closer to the truth. Table [2] further supports this point numerically, showing 
that the rearrangement reduces the L p approximation error quite substantially in most 
cases. 

5. Conclusion 

In this paper, we have applied the rearrangement procedure to monotonize Edgeworth 
and Cornish-Fisher expansions and other related expansions of distribution and quantile 
functions. The benefits of doing so are twofold. First, we have obtained estimates 
of the distribution and quantile curves of the statistics of interest which satisfy the 
logical monotonicity restriction, unlike those directly given by the truncation of the 
series expansions. Second, we have shown that doing so resulted in better approximation 
properties. 

Appendix A. Proof of Proposition 1 

We consider the case where X = [0, 1] only, as the more general intervals can be dealt 
similarly. The first part establishes the weak inequality, following in part the strategy 
in Lorentz's (1953) proof. The proof focuses directly on obtaining the result stated in 
the proposition. The second part establishes the strong inequality. 

Proof of Part 1. We assume at first that the functions /(•) and /o(-) are simple 
functions, constant on intervals ((s — l)/r, s/r], s — 1, ...,r. For any simple /(•) with r 
steps, let / denote the r- vector with the s-th element, denoted f s , equal to the value of 
/(■) on the s-th interval. Let us define the sorting operator S(f) as follows: Let I be an 
integer in 1, r such that fg > f m for some m> I. If £ does not exist, set S(f) = f. If 
i exists, set S(f) to be a r- vector with the £-th element equal to f m , the m-th element 
equal to fg, and all other elements equal to the corresponding elements of /. For any 
submodular function L : R 2 — > K + , by fi > f m , f 0m > f £ and the definition of the 
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submodularity, 

L(f m , foi) + L(ft, fom) < L(ft, foi) + L(f m , fom). 

Therefore, we conclude that 



L(S(f)(x),f (x))dx< / L(f(x),f (x))dx, (A.l) 
x Jx 

using that we integrate simple functions. 

Applying the sorting operation a sufficient finite number of times to /, we obtain a 
completely sorted, that is, rearranged, vector /*. Thus, we can express /* as a finite 
composition /* = S o ... o S(f) . By repeating the argument above, each composition 
weakly reduces the approximation error. Therefore, 

L(f(x)J (x))dx< [ L(^.^S(f)J (x))dx< [ L(f(x)J (x))dx. (A.2) 
x Jx TT^C J x 

finite times 

Furthermore, this inequality is extended to general measurable functions /(•) and /o(-) 
mapping X to K by taking a sequence of bounded simple functions /^(-) and f$\-) 
converging to /(■) and f (-) almost everywhere as r — > oo. The almost everywhere 
convergence of f^(-) to /(•) implies the almost everywhere convergence of its quantile 
function /*^(-) to the quantile function of the limit, /*(•). Since inequality flA.2j) holds 
along the sequence, the dominated convergence theorem implies that flA.20 also holds 
for the general case. □ 

Proof of Part 2. Let us first consider the case of simple functions, as defined in Part 
1. We take the functions to satisfy the following hypotheses: there exist regions X and 
Xq, each of measure greater than 5 > 0, such that for all x e X and x' G Xq, we have 
that (i) x' > x, (ii) f(x) > f(x') + e, and (iii) fo(x') > fo(x) + e, for e > specified in 
the proposition. For any strictly submodular function L : R 2 — ► R + we have that 

rj = inf{L(v', t) + L(v, t') - L{v, t) - L(v', t')} > 0, 

where the infimum is taken over all v, v', t, t' in the set K such that v' > v + e and 
a > t + e. 
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We can begin sorting by exchanging an element f(x), x e X , of r- vector / with an 
element f(x'), x' G Xq, of r-vector /. This induces a sorting gain of at least r] times 1/r. 
The total mass of points that can be sorted in this way is at least 5. We then proceed to 
sort all of these points in this way, and then continue with the sorting of other points. 
After the sorting is completed, the total gain from sorting is at least 5r]. That is, 

/ L(f*(x),f (x))dx< [ L(f(x),f (x))dx-6r). 

J X J X 

We then extend this inequality to the general measurable functions exactly as in the 
proof of part one. □ 
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Table 1. Estimation errors for approximations to the distribution func- 
tion of the standarized sample mean from a Gamma(l/16, 16) population. 
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Table 2. Estimation errors for approximations to the distribution func- 
tion of the standarized sample mean from a Gamma(l/16, 16) population. 
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Figure 2. Distribution Functions, First Order Approximations, Third 
Order Approximations, and Rearrangements for the standarized sample 
mean from a Gamma(l/16, 16) population. 




Figure 3. Quantile Functions, First Order Approximations, Third Order 
Approximations, and Rearrangements for the standarized sample mean 
from a Gamma(l/16, 16) population. 



